AITopics | gender recognition

Collaborating Authors

gender recognition

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Exploring the Feasibility of Deep Learning Techniques for Accurate Gender Classification from Eye Images

Hasan, Basna Mohammed Salih, Mstafa, Ramadhan J.

arXiv.org Artificial IntelligenceAug-8-2025

Gender classification has emerged as a crucial aspect in various fields, including security, human-machine interaction, surveillance, and advertising. Nonetheless, the accuracy of this classification can be influenced by factors such as cosmetics and disguise. Consequently, our study is dedicated to addressing this concern by concentrating on gender classification using color images of the periocular region. The periocular region refers to the area surrounding the eye, including the eyelids, eyebrows, and the region between them. It contains valuable visual cues that can be used to extract key features for gender classification. This paper introduces a sophisticated Convolutional Neural Network (CNN) model that utilizes color image databases to evaluate the effectiveness of the periocular region for gender classification. To validate the model's performance, we conducted tests on two eye datasets, namely CVBL and (Female and Male). The recommended architecture achieved an outstanding accuracy of 99% on the previously unused CVBL dataset while attaining a commendable accuracy of 96% with a small number of learnable parameters (7,235,089) on the (Female and Male) dataset. To ascertain the effectiveness of our proposed model for gender classification using the periocular region, we evaluated its performance through an extensive range of metrics and compared it with other state-of-the-art approaches. The results unequivocally demonstrate the efficacy of our model, thereby suggesting its potential for practical application in domains such as security and surveillance.

artificial intelligence, deep learning, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2508.00135

Country: Asia > Middle East > Iraq > Kurdistan Region > Duhok Governorate (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Beyond the binary: Limitations and possibilities of gender-related speech technology research

Sanchez, Ariadna, Ross, Alice, Markl, Nina

arXiv.org Artificial IntelligenceSep-24-2024

This paper presents a review of 107 research papers relating to speech and sex or gender in ISCA Interspeech publications between 2013 and 2023. We note the scarcity of work on this topic and find that terminology, particularly the word gender, is used in ways that are underspecified and often out of step with the prevailing view in social sciences that gender is socially constructed and is a spectrum as opposed to a binary category. We draw attention to the potential problems that this can cause for already marginalised groups, and suggest some questions for researchers to ask themselves when undertaking work on speech and gender.

dataset, gender, speech, (15 more...)

arXiv.org Artificial Intelligence

2409.13335

Country:

Oceania > New Zealand (0.04)
Oceania > Australia (0.04)
Europe > United Kingdom > England > Essex (0.04)
(4 more...)

Genre:

Research Report (1.00)
Overview (0.88)

Industry:

Law > Civil Rights & Constitutional Law (0.68)
Health & Medicine > Therapeutic Area (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Acoustic models of Brazilian Portuguese Speech based on Neural Transformers

Gauy, Marcelo Matheus, Finger, Marcelo

arXiv.org Artificial IntelligenceDec-14-2023

An acoustic model, trained on a significant amount of unlabeled data, consists of a self-supervised learned speech representation useful for solving downstream tasks, perhaps after a fine-tuning of the model in the respective downstream task. In this work, we build an acoustic model of Brazilian Portuguese Speech through a Transformer neural network. This model was pretrained on more than $800$ hours of Brazilian Portuguese Speech, using a combination of pretraining techniques. Using a labeled dataset collected for the detection of respiratory insufficiency in Brazilian Portuguese speakers, we fine-tune the pretrained Transformer neural network on the following tasks: respiratory insufficiency detection, gender recognition and age group classification. We compare the performance of pretrained Transformers on these tasks with that of Transformers without previous pretraining, noting a significant improvement. In particular, the performance of respiratory insufficiency detection obtains the best reported results so far, indicating this kind of acoustic model as a promising tool for speech-as-biomarker approach. Moreover, the performance of gender recognition is comparable to the state of the art models in English.

dataset, gender recognition, transformer, (10 more...)

arXiv.org Artificial Intelligence

2312.09265

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
South America > Brazil > Pernambuco > Recife (0.04)
South America > Brazil > Rio Grande do Sul > Porto Alegre (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

MiVOLO: Multi-input Transformer for Age and Gender Estimation

Kuprashevich, Maksim, Tolstykh, Irina

arXiv.org Artificial IntelligenceSep-22-2023

Age and gender recognition in the wild is a highly challenging task: apart from the variability of conditions, pose complexities, and varying image quality, there are cases where the face is partially or completely occluded. We present MiVOLO (Multi Input VOLO), a straightforward approach for age and gender estimation using the latest vision transformer. Our method integrates both tasks into a unified dual input/output model, leveraging not only facial information but also person image data. This improves the generalization ability of our model and enables it to deliver satisfactory results even when the face is not visible in the image. To evaluate our proposed model, we conduct experiments on four popular benchmarks and achieve state-of-the-art performance, while demonstrating real-time processing capabilities. Additionally, we introduce a novel benchmark based on images from the Open Images Dataset. The ground truth annotations for this benchmark have been meticulously generated by human annotators, resulting in high accuracy answers due to the smart aggregation of votes. Furthermore, we compare our model's age recognition performance with human-level accuracy and demonstrate that it significantly outperforms humans across a majority of age ranges. Finally, we grant public access to our models, along with the code for validation and inference. In addition, we provide extra annotations for used datasets and introduce our new benchmark.

benchmark, dataset, recognition, (16 more...)

arXiv.org Artificial Intelligence

2307.04616

Country: North America > United States > Florida > Pinellas County (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Manipulating Transfer Learning for Property Inference

Tian, Yulong, Suya, Fnu, Suri, Anshuman, Xu, Fengyuan, Evans, David

arXiv.org Artificial IntelligenceMar-21-2023

Transfer learning is a popular method for tuning pretrained (upstream) models for different downstream tasks using limited data and computational resources. We study how an adversary with control over an upstream model used in transfer learning can conduct property inference attacks on a victim's tuned downstream model. For example, to infer the presence of images of a specific individual in the downstream training set. We demonstrate attacks in which an adversary can manipulate the upstream model to conduct highly effective and specific property inference attacks (AUC score $> 0.9$), without incurring significant performance loss on the main task. The main idea of the manipulation is to make the upstream model generate activations (intermediate features) with different distributions for samples with and without a target property, thus enabling the adversary to distinguish easily between downstream models trained with and without training examples that have the target property. Our code is available at https://github.com/yulongt23/Transfer-Inference.

artificial intelligence, deep learning, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2303.11643

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.82)

Add feedback

SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning

Kang, Zuheng, Wang, Jianzong, Peng, Junqing, Xiao, Jing

arXiv.org Artificial IntelligenceNov-16-2022

Estimating age from a single speech is a classic and challenging topic. Although Label Distribution Learning (LDL) can represent adjacent indistinguishable ages well, the uncertainty of the age estimate for each utterance varies from person to person, i.e., the variance of the age distribution is different. To address this issue, we propose selective variance label distribution learning (SVLDL) method to adapt the variance of different age distributions. Furthermore, the model uses WavLM as the speech feature extractor and adds the auxiliary task of gender recognition to further improve the performance. Two tricks are applied on the loss function to enhance the robustness of the age estimation and improve the quality of the fitted age distribution. Extensive experiments show that the model achieves state-of-the-art performance on all aspects of the NIST SRE08-10 and a real-world datasets.

age estimation, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2210.09524

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.68)

Add feedback

Neural Network Based Gender Recognition for Voice Commands

#artificialintelligenceMay-26-2021, 14:15:23 GMT

Voice commands are a big part of modern AI applications. The tensorflow website has published a good tutorial for basic voice command recognition [link]. This blog post was inspired by that tutorial, but we set out to do something different. That tutorial builds a neural network-based model to classify commands of "down," "go," "left," "no," "right," "stop," "up," and "yes" from a subset of the Speech Commands dataset, which comprises 8,000 audio files across the commands (under CC_BY license). We, however, are interested in the gender perception of the voice commands.

gender, tutorial, waveform, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.61)

Add feedback

End-to-end Multimodal Emotion and Gender Recognition with Dynamic Weights of Joint Loss

Chae, Myungsu, Kim, Tae-Ho, Shin, Young Hoon, Kim, June-Woo, Lee, Soo-Young

arXiv.org Machine LearningSep-6-2018

Multi-task learning (MTL) is one of the method for improving generalizability of multiple tasks. In order to perform multiple classification tasks with one neural network model, the losses of each task should be combined. Previous studies have mostly focused on prediction of multiple tasks using joint loss with static weights for training model. Choosing weights between tasks have not taken any considerations while it is set by uniformly or empirically. In this study, we propose a method to make joint loss using dynamic weights to improve total performance not an individual performance of tasks, and apply this method to end-to-end multimodal emotion and gender recognition model using audio and video data. This approach provides proper weights for each loss of the tasks when training ends. In our experiment, a performance of emotion and gender recognition with proposed method shows lower joint loss which is computed as negative log-likelihood than the one with static weights of joint loss. Also, our proposed model shows better generalizability than compared models. In our best knowledge, this research shows the strength of dynamic weights of joint loss for maximizing total performance at first in emotion and gender recognition task.

artificial intelligence, machine learning, recognition, (17 more...)

arXiv.org Machine Learning

1809.00758

Country:

Europe > Netherlands > North Holland > Amsterdam (0.05)
Europe > Sweden > Stockholm > Stockholm (0.05)
Asia > South Korea > Daejeon > Daejeon (0.05)
(5 more...)

Genre: Research Report > New Finding (0.87)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Is your smile male or female? Mapping the dynamics of a smile to enable gender recognition

#artificialintelligenceMar-21-2018, 00:08:08 GMT

Although automatic gender recognition is already available, existing methods use static images and compare fixed facial features. The new research, by the University of Bradford, is the first to use the dynamic movement of the smile to automatically distinguish between men and women. Led by Professor Hassan Ugail, the team mapped 49 landmarks on the face, mainly around the eyes, mouth and down the nose. They used these to assess how the face changes as we smile caused by the underlying muscle movements -- including both changes in distances between the different points and the'flow' of the smile: how much, how far and how fast the different points on the face moved as the smile was formed. They then tested whether there were noticeable differences between men and women -- and found that there were, with women's smiles being more expansive.

artificial intelligence, gender recognition, recognition, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Vision > Face Recognition (0.38)

Add feedback

AI can now tell if you're a man or a woman, just by your smile

#artificialintelligenceMar-15-2018, 21:41:18 GMT

Men and women have different patterns of smiling, new research reports -- and this, the authors add, can allow AI to easily distinguish between the genders. Image credits Benjamin D. Glass / U.S. Navy. Many a man has been enraptured by the right smile, and many more will probably follow -- although the opposite doesn't seem to hold true. Regardless, while romance unfolds across the world, one team of researchers from the University of Bradford is working to bring this subtle yet powerful gesture to bear in our interactions with artificial intelligence (AI). According to them, computers can learn to differentiate between men or women simply by observing a smile. Led by Professor Hassan Ugail, the team mapped 49 distinct points (or'landmarks) on smiling human faces -- mainly around the eyes, mouth, and down the nose.

artificial intelligence, gender recognition, recognition, (5 more...)

#artificialintelligence

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback